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Field of the technology 

This invention pertains to a novel PGTH protein of human brain origin having a 
prostaglandin transport activity and the pgth gene encoding the protein. 

Prior art 

Prostaglandin is a generic name for a series of physiologically active lipids such as 
prostaglandin E, prostaglandin D, prostaglandin F, prostaglandin I, prostaglandin J, etc. 
Prostaglandin is a physiologically active substance inside the body strongly related to control of 
physiological functions such as blood flow rate, sleeping, gastric mucosa protective action, 
thrombus formation, pregnancy, etc., through specific cell membrane or intranuclear receptors. 

Prostaglandin is produced inside cells as a result of eicosapolyenic acids such as 
arachidonic acid, etc., being cut out by phospholipase A2 from the cell membrane and converted 
with cyclooxygenase and various prostaglandin synthetic enzymes by responding to various 
physiological stimuli, and after being released outside the cells, it has autocrine or paracrine 
effects. On the other hand, liberated prostaglandin is also circulated in the blood flow, taken up 
by a specific cell, metabolized and so disappears. 

A trace amount of prostaglandin shows a strong physiological activity, and consequently, 
the production of prostaglandin compounds is strictly controlled by controlling the activity of 
production-related and metabolism-related enzymes. 

However, prostaglandin has been reported to be unable to pass through the lipid double 
layer of the cell membrane by itself. Therefore, as a prostaglandin transport mechanism, the 
presence of a special protein has been presumed in the process of prostaglandin produced inside 
a cell exiting it and the process of prostaglandin circulating in the blood flow being taken up into 
a specific cell. 

As a protein involved in the transport mechanism described above, prostaglandin transporter 
(abbreviated hPGT: human prostaglandin transporter, below) has been reported, but it is not a 
protein involved in the transport of all prostaglandin compounds, and there are many unclear 
points. Consequently, it is thought that if a biological molecule other than hPGT involved in the 
transport mechanism can be elucidated, the biological molecule found might be usable directly as 
a medical drug or indirectly as a compound for studying compounds that might be usable as a 
medical drug. Therefore, the objective of this invention is to identify such a molecule and use it 
as a medical drug or for the development of medical drugs. 
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Presentation of the invention 

The inventors of this invention studied diligently to find the desired protein by using 
genes expressed in the human brain, and as a result, they found the presence of a novel PGTH 
protein (prostaglandin transporter homologue), successfully isolated a pgth gene encoding the 
protein, and they arrived at this invention. 

Specifically, this invention pertains to (a) a protein having the amino acid sequence 
described in sequence No. 1 or (b) a protein having an amino acid sequence with 1 to several 
amino acids deleted, substituted or added to the amino acid sequence of sequence No. 1, and 
having a prostaglandin transport activity. 

Furthermore, this invention also pertains to (c) a gene comprising DNA described 
sequence No. 2 or (d) DNA which can be hybridized with the DNA of sequence No. 2 under 
stringent conditions and which encodes a protein having a prostaglandin transport activity. 

The pgth gene of this invention can be isolated as a cDNA fragment containing the gene 
from a cDNA library of human brain origin. The cDNA library used by the inventors of this 
invention was prepared based on commercially available mRNA of human brain origin from the 
Clonetech Co. 

As a method for identifying the cDNA encoding a protein having a prostaglandin 
transport activity in the cDNA library described above, the method of Ohara, et al., (DNA 
Research 4: p 53, 1997) was used as an extensive cDNA library analysis method using a 
long-chain cDNA library. From a long-chain cDNA library of human brain origin prepared by 
the method of Ohara, et al., 25,000 recombinants are randomly selected, the 5' and 3' - base 
sequences of the cDNA from 15,000 clones were determined, and a clone showing homology to 
the gene encoding hPGT already reported from the 5' sequences of all the clones can be found by 
using a DNA analysis program (BLAST and FastA). 

The presence of a region encoding the protein (ORF: open reading frame) in the base 
sequence can be confirmed by a conventional method using a computer program. After 
becoming confident of the presence of the desired gene in the cDNA sequence, the inventors of 
this invention found one ORF in the sequence by utilizing a computer, the gene was named pgth, 
and the protein encoded by the gene was named PGTH. The PGTH of the invention is a protein 
comprising a total of 709 amino acid residues and having a molecular weight of about 80 kd. 

The invention pgth is a gene comprising 2130 bp shown in sequence No. 2. By using this 
pgth and conventional genetic recombination techniques using a suitable host vector system, it is 
possible to prepare a recombinant gene. As a suitable vector, there are plasmids of E. coli origin 
(such as pBR322, pUCl 18, etc.), of Bacillus subtilis origin, (such as pSH19, etc.) yeast origin 
plasmid (such as pUBl 10, pC194, etc.), bacteriophages, animal viruses such as retroviruses, 
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vaccinia virus, etc., etc. At the time of recombination, it is possible to add translation initiation 
and termination codons using suitable DNA adaptors. Furthermore, for gene expression, a 
suitable expression promoter is attached upstream of the gene. The promoter to be used is 
suitably selected depending on the host used. For example, if the host is E. coli, there are T7, lac, 
tip, XPL promoters, etc.; if the host is a Bacillus, there are SPO promoters, etc.; if the host is a 
yeast, there are PH05, GAP, ADH promoters, etc.; and if the host is an animal cell, there are 
SV40-origin, retrovirus promoters, etc. 

Furthermore, the gene may be expressed as a fused protein with another protein (such as 
glutathione-S-transferase, protein A, etc.). In the case of a fused PGTH prepared by using such a 
method, a suitable protease (such as thrombin, etc.), may be used to cut out the protein. 

As a host usable in the case of PGTH expression, there are various strains of Escherichia 
coli, various strains of Bacillus subtilis, various strains of the yeast Saccharomyces cerevisiae 
and animal cells such as COS-7, CHO cells, etc. 

As a method for transforming a host cell using the above recombinant vector, a specific 
method conventionally used to transform the selected host cell is used. 

Incidentally, in this invention, DNA which has a DNA sequence other than that shown in 
sequence No. 2 which can be hybridized with the DNA and encodes a protein having a 
prostaglandin transport activity, is also included in the scope of this invention. 

Specifically, DNA which has a DNA sequence, the total length of the pgth sequence, 
partially changed due to various artificial treatments such as random mutations, introduction of 
site-specific mutations, or mutagen treatment, DNA fragment mutation, deletion ligation after 
scission with restriction enzymes, is also included in the scope of this invention in spite of 
having a DNA sequence different from that of sequence No. 2 as long as such a DNA variant can 
be hybridized with pgth under stringent conditions and encodes a protein having a prostaglandin 
transport activity. 

The extent of the above DNA mutation is within the allowable range if the variant has 
90% or higher homology with the DNA sequence of pgth. Furthermore, as an extent of 
hybridization with pgth, Southern hybridization with pgth may be carried out under conventional 
conditions, for example, in the case of probe labeling with a DIG DNA Labeling kit 
(Boehringer-Mannheim Cat. No. 1 175033), hybridization conditions of a DIG Easy Hyb solution 
(Boehringer-Mannheim Cat. No. 1603558) at 32°C and washing of the membrane in a 5X SSC 
solution (containing 0.1% w/v SDS) at 50°C (IX SSC comprises 0.1 5M NaCl and 
0,01 5M sodium citrate). 
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Furthermore, a protein encoded by the gene variant which is highly homologous to pgth 
as described above and has a prostaglandin transport activity is also included in the scope of this 
invention. 

Specifically, a variant having one or more amino acids deleted, substituted or added to 
the amino acid sequence of PGTH is included in the scope of this invention as long as this 
variant is a protein having a prostaglandin transport activity. 

The side chains of the amino acids, which are the constituent elements of proteins are 
respectively different with respect to hydrophobicity, electrical charge, size, etc., but several 
highly conservative relationships in the meaning of practically not affecting the 
three-dimensional structure (it is also called the steric structure) of proteins have been known 
from experiences or actual physicochemical observations. For example, for substitution of amino 
acid residues, there are glycine (Gly) and proline (Pro), Gly and alanine (Ala) or valine (Val), 
leucine (Leu) and isoleucine (He), glutamic acid (Glu) and glutamine (Gin), aspartic acid (Asp) 
and asparagine (Asn), cysteine (Cys) and threonine (Thr), Thr and serine (Ser) or Ala, lysine 
(Lys) and arginine (Arg), etc. 

Therefore, any variant protein due to substitution, insertion, deletion, etc., in the amino 
acid sequence of the PGTH shown in sequence No. 1 can be said to be within the scope of this 
invention if the variation is a variation which conserves the three-dimensional structure of the 
PGTH, and the protein is a protein having a prostaglandin transport activity similar to PGTH. 
The allowable extent of this variation is 90% or higher homology with the amino acid sequence 
shown in sequence No. 1. 

Industrial application field 

The abnormal expression of pgth or functional failure of PGTH is presumed to be a 
critical disorder because PGTH has a prostaglandin transport activity, and consequently the 
normal prostaglandin production mechanism of the body is lost. 

Therefore, PGTH itself is considered to be useful as a drug, and on the other hand, pgth 
or PGTH may be used for effectively studying or evaluating a substance having the same 
function as that of PGTH, a substance promoting or inhibiting its function, a substance 
promoting the expression of the gene, etc. 

Best embodiment of the present invention 

This invention is explained further in detail using application examples as follows, but 
this invention is certainly not limited to these application examples. Incidentally, unless 
specified, the experimental procedures used in the following application examples are those 
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described in standard experimental manuals such as Molecular Cloning, 2 n ed. (Cold Spring 
Harbor Laboratory Press, 1989), etc., and the operating manuals in commercially available kits, 
and they can be carried out under the conditions recommended for the respective commercially 
available products such as restriction enzymes, etc. 

Application Example 1 Cloning of pgth 

1) Construction of a long chain cDNA library of human brain origin 

An oligonucleotide (GACTAGTTCTAGATCGCGAGCGGCCGCCC(T)is) containing a 
NotI site was synthesized using a DNA synthesizer (ABI380B). It was used as a primer, and a 
double chain cDNA was synthesized using mRNA of human brain origin as a template and the 
Superscript II reverse transcriptase kit (Gibco BRL). The ligation of the synthetic DNA was 
carried out with the cDNA and Sail site-containing adapter (Takara Shuzo), subsequently, NotI 
digestion was carried out, and cDNA fragments of 3 kb or larger were purified using 
electrophoresis with a 1% concentration of low-melting agarose. 

After ligation of the purified cDNA fragments with a Sall-NotI restriction enzyme-treated 
pBluescriptIISK+ plasmid, the recombinant plasmids were introduced into E. coli ElectroMax 
DH10B strain (Gibco BRL) using the electroporation method. Subsequently, 25,000 
recombinants were randomly selected from the library, the recombinant DNAs were extracted, 
and the 5'- and 3'-base sequences of the cDNAs of 15,000 clones were determined. For the 
sequence determination, a PE Applied Biosystem Co., DNA sequencer (ABI PRISM377) and the 
reaction kit from the same company were used. 

2) Selection of clones containing the pgth sequence 

The 5 sequences of all the clones determined in 1) were compared with the sequence of 
hPGT already reported using DNA analytical programs (BLAST and FastA), and as a result, a 
clone named HK07457 showed significant homology. 

3) DNA fragment base sequence determination 

The base sequence determination was carried out using a PE Applied Biosystem Co. 
DNA sequencer and the dye primer method. The sequence was mostly determined using the 
shotgun method, and for a portion of the base sequence, an oligonucleotide was synthesized 
based on the base sequence already determined, and the primer walking method was used to 
determine the entire base sequences of the two chains. The entire base sequence of the cDNA of 
the clone is shown in sequence No. 3. 



o 



o 



9 



The cDNA contains an ORF encoding a protein (PGTH) comprising 709 residues. A 
termination codon was found to appear in the upstream region of a methionine residue, which 
was an initiation codon of the protein, with the same reading frame. Therefore, the amino acid 
sequence shown in sequence No. 3 was confirmed to be the only possibility as an amino acid 
sequence of the protein encoded by the cDNA fragment. 

Figure 1 shows the amino acid homology between already reported hPGT and the PGTH 
of this invention. The two show high homology, especially, the position of the cysteine residue 
present at the C-terminal of PGTH is preserved, and the 77 th residue glutamine, 561 st residue 
arginine and 614 th residue lysine of hPGT, which are amino acids especially important for the 
transport activity, are also preserved in PGTH. 

Application Example 2 

Confirmation of protein expression by in vitro translation of pgth 

The plasmid containing pgth prepared in Application Example 1 was treated with RNase 
A, subsequently, RNase A was removed using ADVAMAX beads (AGTC Co.), and in vitro 
translation was carried out using a TNT T7 coupled reticulocyte lysate system (Promega Co.) in 
the presence of ( 35 S)-methionine. A portion of the reaction mixture was separated by 
SDS-polyacrylamide gel electrophoresis (SDS-PAGE), and the analysis carried out using 
BAS-2000 (Fuji Shashin Kogyo). As a result, the presence of a single band at about 80 kd was 
confirmed as shown in Figure 2. 

Application Example 3 

Construction of animal cell expression vector 

1) Amplification of ORF-containing cDNA 

An oligonucleotide (following sequence 1) having a sequence upstream from the 
initiation codon of the protein of sequence No. 3 and oligonucleotide (following sequence 2) 
having a sequence of a portion downstream from the termination codon of the protein and the 
reverse complementary strand chain were synthesized using a DNA synthesizer (ABI Co., Model 
380B). 



Sequence 1 



5-CTGGAGCTCACTGCACTCCAGCAGTC-3 



Sequence 2 

5-AGCTCACACTCGGGAATCCTCTGGCTTC-3 
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The recombinant cDNA containing sequence No. 3 isolated in application example 1 was 
used as a template, the oligonucleotides of the sequences 1 and sequences 2 were used as a 
primer, and the following PCR procedures were carried out using a Takara LA PCR kit Ver. 2 
and the PCR thermal cycler MP (Takara Shuzo). 

cDNA 5 jiL(lOng) 

1 OX PCR buffer (containing 25 mM Mg ++ ) 5 

2.5 mM dNTP 8 

10 \iM Sequence 1 2 \xL 

10 jiM Sequence 2 2 

Water 27.5 \iL 

LA Taq polymerase 0.5 |iL 

Total amount 50 |iL 

The PCR cycle was carried out by holding at 94°C for 2 mn, carrying out the reaction at 
98°C for 20 sec, cooling to 68°C at a rate of l°C/2 sec, holding at 68°C for 3 min, at 72°C for 10 
min, and repeating 30 times. 

The above method was used to amplify a DNA fragment (about 2.2 kb) having a portion 
of sequence No. 3. 

2) Subcloning to an animal cell expression vector 

The DNA fragment amplified in 1) was fractionated by 1% agarose gel electrophoresis. 
After staining the gel with ethidium bromide, the gel containing the desired band observed under 
ultraviolet irradiation was cut out. The extraction of the DNA fragment from the agarose gel and 
purification were carried out using a GENECLEAN II Kit (Biol 01 Co.) 

The extracted and purified DNA fragment was subcloned to animal cell expression vector 
pTARGET (Promega Co.) The ligation solution used was a Takara Ligation Kit Ver. 2 (Takara 
Shuzo), and the reaction was carried out with the following composition at 16°C for 1.5 h. 

Extracted and purified DNA fragment 1 |iL (50 ng) 

PTARGET 1 ^L(lOng) 

Water 3 ^iL 

Ligation solution 5 ^L 

Total 10 nL 

The reaction solution after the above reaction was used to transform the E. coli K12 strain 
DH5. The transformant was inoculated on an LB agar medium containing 50 \igfmL of 
ampicillin (Amp), 40 ng/mL of 5-bromo-4-chloro-3-indolyl-p-D-galactoside (IPTG) [sic; 
isopropyl-P-D-thioglucopyranoside] and 100 of isopropyl-b-D-thiogalactopyranoside 
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(X-gal) [sic; 5-bromo-4-chloro-3-indolyl-P-D-galactopyranoside] 'and cultivated overnight at 
37°C. 

Each colony that developed on the above plate was inoculated in 10 mL of an LB liquid 
medium containing 50 ^ig/mL of Amp, cultivation was carried out overnight at 37°C, the 
biomass was collected by centrifugation, and subsequently the recombinant DNA was purified 
using a QIAprep Spin Plasmid Miniprep Kit (Qiagen Co.) to obtain pTARGETpgth. 

3) Determination of the base sequence of the inserted cDNA 

The base sequence determination was carried out using a DNA sequencer (ABI Co., 
Model PRISM377) and the dye terminator method, and the whole base sequence of the two 
chains was determined using the primer walking method. The clone was found to contain all of 
the region between sequences 1 and 2 among sequence No. 3 confirming that the desired gene 
pTARGETpgth had been cloned. 

Application Example 4 

Insertion into CHOkl cells and stable transformant preparation 

The recombinant DNA, pTARGETpgth, prepared in Application Example 2 has a CMV 
promoter upstream of pgth, and if it is inserted into an animal cell, the expression of pgth is 
possible. 

CHOkl cells were cultured in 60 mm diameter plastic Petri dishes. As the culture 
medium, Ham F-12 (Gibco, called growth medium, below) containing 10% fetal bovine serum 
(Dainippon Seiyaku), 50 U/mL of penicillin and 50 ng/mL of streptomycin was used, and culture 
was carried out at 37°C in the presence of 5% C0 2 . When the cell density was 50%, 
LIPOFECT AMINE reagent (Gibco) containing pTARGETpgth prepared in Application Example 
2 was added in a layer over the cells, incubated for 6 h, and, after replacement with the growth 
medium, culture was continued for 48 h. After dispersing the cells with trypsin, the cell 
suspension was placed in a 60 mm diameter plastic Petri dish, and culture was carried out for 24 
h. After removing the culture medium, it was replaced by growth medium containing G418 
(Gibco, final concentration of 500 ng/mL). The G418 medium was changed every 3 days and 
culture continued for 2 weeks. When the cell colonies were observable with the naked eye, 3 
colonies were isolated using stainless steel cups. As a control, only the pTARGET vector 
(Promega Co.) was inserted into CHOkl cells by carrying out the same procedures as those 
described above to isolate a stable transformant. 



* [Editor's note: The compound names and abbreviations are so garbled in the original text that it is impossible 
certain whether it should be 40 ug/mL IPTG and 100 urn X-gal, or vice-versa.] 
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washed with a suitable buffer solution containing bovine serum albumin, and culture was 
continued for 20 min using a buffer solution containing ( 3 H)-labeled PGE2 (Amersham Co.). 
After washing the cells, they were recovered, and the radioactivity taken up was measured. As a 
result, the prostaglandin transport activity of the CHOkl cells with pgth inserted was statistically 
significantly higher than that of the CHOkl cells with only the control vector inserted. 

Application Example 6 

Expression of pgth mRNA in human macrophages loaded with oxidized LDL 

1) Preparation of human macrophages loaded with oxidized LDL and normal monocyte cDNA 

Normal monocyte cDNA was prepared using RNA prepared with Trizol (Gibco BRL 
Co.) from CD14-positive monocytes from human peripheral blood as a template and the 
Superscript II reverse transcriptase kit (Gibco BRL). Human macrophages loaded with oxidized 
LDL were prepared by culturing normal monocytes in a RPMI-1 640 medium (Dainippon 
Seiyaku) containing 20% AB serum and antibiotics for 14 days, adding human LDL oxidized 
with copper sulfate using conventional procedures (oxidized LDL) in the final concentration of 
40 fi/mL [sic; dimension incorrect] and continuing culture for 24 h. A method similar to that 
used for normal monocytes was used to prepare cDNA. 

2) Confirmation of pgth mRNA expression by the RT-PCR method 

Oligonucleotides (following sequence 3) having a sequence contained in sequence No. 2 and 
oligonucleotides (following sequence 4) having the sequence of the reverse complementary 
strand were respectively synthesized using a DNA synthesizer (ABI Co., Model 380B). 



5-TCACACTCGGGAATCCTCTGGCTTC-3 

The cDNA prepared in (1) was used as a template, the oligonucleotides with sequences 3 and 4 
were used as primers, and the following PCR procedures were carried out using a Takara LA 
PCR kit Ver. 2 and the PCR thermal cycler MP (Takara Shuzo). 



Sequence 3 



5-GCTCCTGCCCATTGGACGGCTTTAACC-3 



Sequence 4 



cDNA 2 nL (40 ng) 

1 OX PCR buffer (containing 25 mM Mg^) 1 .5 ^L 
2.5 mM dNTP 2.4 \iL 
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10 nM Sequence 3 0.4 ^L 

10 |iM Sequence 4 0.4 [iL 

Water 10.15 nL 

LA Taq polymerase 0.15 |iL 

Total amount 15 |iL 



The PCR cycle was carried out by holding at 94°C for 5 min, carrying out the reaction at 
94°C for 1 min, holding at 58°C for 1 min, furthermore at 72°C for 1 min, and repeating 30 
times. The PCR reaction mixture was fractionated using 1% agarose gel electrophoresis. After 
staining the gel with ethidium bromide, the ultraviolet irradiation was carried out to detect an 
amplified band at about 500 bp. Similarly, the glyceraldehyde 3-phosphate dehydrogenase gene 
amplified primer (G3PDH, Clonetech Co.) was used as the standard cDNA for PCR testing. As a 
result, the expression of pgth mRNA was strongly induced in the macrophages loaded with 
oxidized LDL, as shown in Figure 3. 

Normal monocytes, macrophages loaded with oxidized LDL or equivalent cultured cells 
may be cultured with a test compound added, and subsequently the change in the PGTH mRNA 
may be measured by the method described above to screen any substance controlling PGTH 
mRNA expression. 

Brief description of the figures 

Figure 1 shows comparison of amino acid sequence homology between hPGT and the 
PGTH of this invention. 

Figure 2 shows the results of SDS-PAGE of PGTH expressed using the in vitro 
translation method using pgth. 

Figure 3 shows the results of detection of mRNA for the expression of pgth in human 
macrophages loaded with oxidized LDL using the RT-PCR method. In the figure o shows the 
results for human macrophages loaded with oxidized LDL, and m shows the results for normal 
human monocytes. 

Claims 

(1) A protein of the following (a) or (b). 

(a) Protein comprising the amino acid sequence of sequence No. 1 

(b) Protein comprising an amino acid sequence with one or more amino acids deleted, 
substituted or added to the amino acid sequence of sequence No. 1, and, at the same time, having 
a prostaglandin transport activity. 
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(2) DNA of the following (a) or (b). 

(a) DNA comprising the base sequence of sequence No. 2 

(b) DNA which can be hybridized with the DNA of sequence No. 2 under stringent 
conditions and at the same time, encodes a protein having a prostaglandin transport activity. 
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looo n c 19 
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1 


< 2 1 1 > 
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<212> 
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Gly Ser Ala Leu Ala Cys Leu Thr His Thr Pro Scr Phc Met Leu 

575 580 585 

lie Leu Arg Gly Val Lys Lys Glu Asp Lys Thr Leu Ala Val Gly 

590 595 600 

lie Gin Phe Mel Phe Leu Arg He Leu Ala Trp Mel Pro Ser Pro 

605 610 615 

Val He His Gly Ser Ala He Asp Thr Thr Cys Val His Trp Ala 

620 625 630 

Leu Ser Cys Gly Arg Arg Ala Val Cys Arg Tyr Tyr Asn Asn Asp 

635 640 645 

Leu Leu Arg Asn Arg Phc 11c Gly Leu Gin Phe Phe Phc Lys Thr 

650 655 660 

Gly Ser Val He Cys Phe Ala Leu Val Leu Ala Val Leu Arg Gin 

665 670 675 

Gin Asp Lys Glu Ala Arg Thr Lys Glu Ser Arg Ser Ser Pro Ala 

680 685 690 

Val Glu Gin Gin Leu Leu Val Ser Gly Pro Gly Lys Lys Pro Glu 

695 700 705 

Asp Ser Arg Val 
709 



<210> 2 
<2H> 2130 
< 2 I 2 > DNA 

<2 1 3> Homo sapience 
<400> 2 

10 20 30 40 50 60 

atgggaccca ggatagggee agegggtgag glaccccagg laccagacaa ggaaaccaaa 60 

gecacaatgg gcacagaaaa cacacclgga ggcaaageca gcccagaccc tcaggacglg 120 

eggecaagtg tgttccataa catcaagclg Itcgttctgt gccacagcct getgeagctg 180 

gcgcagctca igaictcegg clacctaaag agctccatct ccacagtgga gaagegctte 240 

ggcctctcca gccagacgtc ggggctgclg gcctccttca acgagglggg gaacacagcc 300 

ttgattgtgt Itgtgagcla ttttggcagc cgggtgcacc gaccccgaat gattggctal 360 

ggggclalcc tlgtggccct ggcgggcclg ctcatgactc tcccgcacll catclcggag 420 

ccataccgct acgacaacac cagccctgag gatatgecac aggactlcaa ggcltccctg 480 

tgcclgccca caacctcggc cccagcctrg gcccccicca atggcaactg clcaagctac 540 

acagaaaccc agcatctgag tglgglgggg atcatgttcg tggcacagac cctgctgggc 600 
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gtgggcgggg Igcccallca gcccltlggc alctcctaca tcgatgactt tgcccacaac 660 

agcaactcgc ccclctaccl cgggatcclg lltgcaglga ccatgatggg gccaggcctg 720 
gcctltgggc tgggcagcct catgctgcgc clltatgtgg acattaacca gatgccagaa 780 
ggtggtatca gcclgaccat aaaggacccc cgatgggtgg gtgcctgglg gctgggtttc 840 
clcalcgctg ccggtgcagt ggccclggct gccatcccct acttcttctt ccccaaggaa 900 
algcccaagg aaaaacglga gctlcagllt cggcgaaagg Icttagcagt cacagactca 960 
cclgccagga agggcaagga ctctccctct aagcagagcc ctggggagic cacgaagaag 1020 
caggatggcc tagtccagat tgcaccaaac ctgactgtga tccagttcat taaagtcttc 1080 
cccagggtgc tgctgcagac cctacgccac cccatcttcc tgctggtggt cctgtcccag 1140 
gtatgcttgt catccaiggc tgcgggcatg gccacctlcc tgcccaagtt cctggagcgc 1200 
cagttltcca tcacagcctc ctacgccaac ctgcicatcg gctgcctctc cttcccttcg 1260 
gtcalcgtgg gcatcgtggt ggglggcgtc ctggtcaagc ggclccacct gggccclgtg 1320 
ggalgcggtg ccctttgcct gctggggalg ctgctgtgcc tcttcttcag cctgccgctc 1380 
llcttlalcg gclgclccag ccaccagatt gcgggcalca cacaccagac cagtgcccac 1440 
cctgggctgg agctgtctcc aagcigcatg gaggcctgct cclgcccatt ggacggcttt 1500 
aaccclgtcl gcgaccccag cactcgtgtg gaalacatca caccctgcca cgcaggclgc 1560 
Icaagctggg tgglccagga tgctctggac aacagccagg ttitctacac caactgcagc 1620 
tgcgtggtgg agggcaaccc cgtgctggca ggatcctgcg actcaacgtg cagccatclg 1680 
glggtgcccl tcctgctcct gglcagcctg ggctcggccc tggcctgtct cacccacaca 1740 
ccctccttca tgctcalcct aagaggagtg aagaaagaag acaagacttt ggctgtgggc 1800 
alccagltca tgttcclgag gattttggcc tggatgccca gccccgtgat ccacggcagc 1860 
gccatcgaca ccacctglgt gcactgggcc ctgagctglg ggcgtcgagc tgtctgtcgc 1920 
tactacaata atgacctgcl ccgaaaccgg ttcatcggcc iccagttctl cttcaaaaca 1980 
ggttctgtga Ictgctlcgc ctlagtttlg gctgtcctga ggcagcagga caaagaggca 2040 
aggaccaaag agagcagalc cagccclgcc gtagagcagc aatlgctagi gtcggggcca 2100 
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<210> 3 

<211> 4083 

<212> DNA 

< 2 1 3 > Homo sapience 



<400> 3 

aagtgaccca gggagacaaa cacttggaga iacttggggc tgaglitgag caagactccc 60 

taacctgtgl ctggacaagt ctgatgtcct gtgtggccca agaagaaclg accccgtgtc 120 

Iggagctccc accgltaltg catccctgct gtggctcacc tgctgctgtc tccaggagcc 180 

cctgagaaga lligcctcct ctcccclgct aagctccagg icctgagatt gaattagggg 240 

ctggagctca ctgcaclcca gcaglc 266 

atg gga ccc agg ala ggg cca gcg ggt gag gta ccc cag gta cca 311 
Met Gly Pro Arg lie Gly Pro Ala Gly Glu Yal Pro Gin Yal Pro 
5 10 15 

gac aag gaa acc aaa gec aca atg ggc aca gaa aac aca cct gga 
Asp Lys Glu Thr Lys Ala Tor Met Gly Thr Glu Asn Thr Pro Gly 

20 25 30 

ggc aaa gec age cca gac cct cag gac gtg egg cca agt gtg ttc 
Gly Lys Ala Ser Pro Asp Pro Gin Asp Yal Arg Pro Ser Yal Phe 

35 40 45 

cat aac ate aag Mr tic git ctg tgc cac age ctg ctg cag clg 
His Asn He Lys Leu Phe Yal Leu Cys His Ser Leu Leu Gin Leu 

50 55 60 

gcg cag etc alg ate ice ggc tac eta aag age tec ate tec aca 
Ala Gin Leu Met llr Ser Gly Tyr Leu Lys Ser Ser He Ser Thr 

6b 70 75 

gig gag aag cge 1 tc krc cic ice age cag acg teg ggg ctg ctg 
Yal Glu Lvs Arg Phe Gly Leu Scr Ser Gin Thr Ser Gly Leu Leu 

80 85 90 

gee tec ttc aac gag gig ggg aac aca gec ttg att gtg til gtg 
Ala Ser Phe Asn Glu Yal Gly Asn Thr Ala Leu lie Yal Phe Val 

95 100 105 

age tat ttl ggc age egg gtg cac cga ccc cga atg att ggc tat 
Ser Tyr Phe Gly Ser Arg Yal His Arg Pro Arg Met lie Gly Tyr 
110 115 120 

ggg get ate ctt gtg gec ctg gcg ggc ctg etc atg act etc ccg 
Gly Ala lie Leu Val Ala Leu Ala Gly Leu Leu Met Thr Leu Pro 

125 130 135 

cac tic ate teg gag cca lac cgc tac gac aac acc age cct gag 
His Phe He Ser Glu Pro Tyr Arg Tyr Asp Asn Thr Ser Pro Glu 

140 145 150 



356 



401 



446 



491 



536 



581 



626 



671 



716 
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170 



175 180 



230 



235 2 40 



gca 



Ala Val Thr Asp Ser Pro Ala Arg Lys Gly Lys Asp Ser Pro Ser 
320 325 330 

aag cag age cct ggg gag tec acg aag aag cag 



Lys Gin Ser Pro Gly Glu Ser Thr Lys Lys Gin Asp Gly Leu Yal 



335 



He Ala Pro Asn Leu Thr Yal lie Gin Phe He Lys Val Phe 
350 355 360 
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gat atg cca cag gac tie aag gel tec ctg tgc ctg ccc aca acc 

Asp Met Pro Gin Asp Phe Lys Ala Ser Leu Cys Leu Pro Thr Thr 

155 160 165 

teg gee cca gee teg gec ccc tec aat ggc aac tgc lea age lac 806 

Ser Ala Pro Ala Ser Ala Pro Ser Asn Gly Asn Cys Ser Ser Tyr 



aca gaa acc cag cat ctg a R t gtg gig ggg ate atg tic gtg gca 851 

Thr Glu Thr Gin His Leu Scr Yal Yal Gly lie Met Phe Yal Ala 
185 190 195 

cag acc ctg ctg ggc gtg ggc ggg gtg ccc all cag ccc itt ggc 896 

Gin Thr Leu Leu Gly Val Gly Gly Val Pro He Gin Pro Phe Gly 
200 205 210 

ate tec tac ate gat gac ill gee cac aac age aac 

He Ser Tyr 11c Asp Asp Phe Ala His Asn Ser Asn Scr Pro Leu 
215 220 225 

tac etc ggg ale ctg tit gca gtg acc atg atg ggg cca ggc ctg 986 

Tyr Leu Gly lie Leu Phe Ala Val Thr Met Mel Gly Pro Gly Leu 



tcgcccctc 94 1 



gec tit ggg ctg ggc age clc atg ctg cgc ctt tat gig gac att 1031 
Ala Phe Gly Leu Gly Ser Leu Met Leu Arg Leu Tyr Val Asp He 
245 250 255 

aac cag atg cca gaa ggt ggt ate age ctg acc ata aag gac ccc 1076 
Asn Gin Mel Pro Glu Gly Gly He Scr Leu Thr He Lys Asp Pro 
260 265 270 

cga tgg gtg ggt gec tgg tgg ctg ggt ttc etc ate get gee ggt H21 
Arg Trp Val Gly Ala Trp Trp Leu Gly Phe Leu He Ala Ala Gly 
275 280 285 

gca gtg gec ctg get gec ate ccc tac tic tic ttc ccc aag gaa 1166 
Ala Yal Ala Leu Ala Ala He Pro Tyr Phe Phe Phe Pro Lys Glu 
290 295 300 

atg ccc aag gaa aaa egl gag ctl cag ttt egg cga aag gtc Ma 121! 
Met Pro Lys Glu Lys Arg Glu Leu Gin Phe Arg Arg Lys Val Leu 
305 310 315 

gtc aca gac tea cct gec agg aag ggc aag gac let ccc let 1 2 b 5 



gat ggc eta gtc 1301 



340 345 
cag att gca cca aac ctg act gtg ate cag ttc att aaa gtc ttc 1346 
Gin 



ccc agg gtg ctg ctg cag acc eta cgc cac ccc ate ttc ctg ctg 1391 

Pro Arg Val Leu Leu Gin Thr Leu Arg His Pro lie Phe Leu Leu 
365 370 375 

gig gtc ctg lec cag gta tgc ttg tea lec atg gel gcg ggc atg 1436 

Yal Val Leu Ser Gin Yal Cys Leu Ser Ser Met Ala Ala Gly Met 
380 385 390 

gec acc ttc ctg ccc aag tic ctg gag cgc cag lit tec ale aca 1481 

Ala Thr Phe Leu Pro Lys Phe Leu Glu Arg Gin Phe Ser He Thr 
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395 400 405 

gcc ice tac gcc aac clg etc ate ggc tgc etc tec ttc eel leg 1526 
Ala Ser Tyr Ala Asn Leu Leu Me Gly Cys Leu Ser Phe Pro Ser 
410 415 420 

gtc ate gig ggc ale gig gig ggt Kgc gle cig gtc aag egg etc 1571 
Yal lie Val Gly He Yal Val Gly Gly Yal Leu Val Lys Arg Leu 
425 430 435 

cac clg ggc ecl gtg gga tgc ggt gcc ell igc clg clg ggg atg 1616 
His Leu Gly Pro Val Gly Cys Gly Ala Leu Cys Leu Leu Gly Met 
440 445 450 

clg clg tgc clc Mc ttc age ctg ccg etc tic til ate ggc Igc 1661 
Leu Leu Cys Leu Phe Phe Ser Leu Pro Leu Phe Phe lie Gly Cys 
455 460 465 

tec age cac cag alt gcg ggc ate aca cac cag acc agt gcc cac 1706 
Ser Ser His Gin He Ala Gly He Thr His Gin Thr Ser Ala His 
470 475 480 

cct ggg ctg gag ctg tel cca age tgc atg gag gcc Igc tec tgc 1751 
Pro Gly Leu Glu Leu Ser Pro Ser Cys Met Glu Ala Cys Ser Cys 
485 490 495 

cca Hg gac ggc Ml aac ecl gle Igc gac ccc age act cgt gtg 1796 
Pro Leu Asp Gly Phe Asn Pro Val Cys Asp Pro Scr Thr Arg Yal 
500 505 510 

gaa lac ate aca ccc Igc cac gca ggc Igc tea age tgg gtg gle 1841 
Glu Tyr lie Thr Pro Cys His Ala Gly Cys Ser Ser Trp Yal Val 
515 520 525 

cag gal gel clg gac aac age cag gtt lie lac acc aac tgc age 1886 
Gin Asp Ala Leu Asp Asn Ser Gin Ya! Phe Tyr Thr Asn Cys Ser 
530 535 540 

tgc gtg gtg gag ggc aac ccc gtg clg gca gga tec tgc gac tea 1931 
Cys Val Yal Glu Gly Asn Pro Yal Leu Ala Gly Ser Cys Asp Ser 
545 550 555 

acg tgc age cat ctg gig gig ccc tic ctg etc clg gle age ctg 1976 
Thr Cys Ser His Leu Yal Val Pro Phe Leu Leu Leu Val Ser Leu 
560 565 570 

ggc leg gcc ctg gcc tgt etc acc cac aca ccc 
Gly Ser Ala Leu Ala Cys Leu Thr His Thr Pro Ser Phe Met Leu 
575 580 585 

ate eta aga gga gig aag aaa gaa gac aag act ttg get gtg ggc 2066 
lie Leu Arg Gly Yal Lys Lys Glu Asp Lys Thr Leu Ala Val Gly 
590 595 600 

ale cag He alg He ctg agg alt tig gcc tgg alg ccc age ccc 2111 
He Gin Phe Met Phe Leu Arg He Leu Ala Trp Met Pro Ser Pro 
605 610 615 

gtg ale cac ggc age gcc ate gac acc acc Igt gtg cac tgg gcc 2156 
Yal He His Gly Ser Ala He Asp Thr Thr Cys Val His Trp Ala 
620 625 630 

ctg age tgt ggg cgt cga gel gtc Igt cgc tac tac aal aat gac 2201 
Leu Ser Cys Gly Arg Arg Ala Yal Cys Arg Tyr Tyr Asn Asn Asp 
635 640 645 

ctg clc cga aac egg He ate ggc clc cag He He He aaa aca 22 4 6 
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2 29 1 
2 336 
238 1 
2396 



Leu Leu Arg Asn Arg Phe lie Gly Leu Gin Phc Phe Phe Lys Thr 

650 655 660 

ggl let gtg ate Igc lie gec tta gtl tig gel gtc ctg agg cag 
Gly ser Val He Cys Phe Ala Leu Val Leu Ala Val Leu Arg Gin 

665 6^0 675 

eag gac aaa gag gca agg acc aaa gag age aga lec age cct get 
Gin Asp Lys Glu Ala Arg Thr Lys Glu Ser Arg Ser Ser Pro Ala 

680 685 690 

gta gag cag caa Itg eta gig leg ggg cca ggg aag aag cca gag 
Val Glu Gin Gin Leu Leu Val Ser Gly Pro Gly Lys Lys Pro Glu 

695 700 705 

gal Ice cga gtg iga 
Asp Ser Arg Val 
709 

gctgtcttgg ggccccacct ggccaagagt agcagccaca gcagtacctc ctctgagtcc 2456 
ttlgcccaag attgggtgtc aagagccclg tgltccattc tggctcctcc actaaaltgc 2516 
tgtglgaclt caggcaagac attgatcctc tctcagcctt tgcltgctag tctgaaccaa 2576 
agagttgltt gggcattlgc tgtgttggcc atttctggag caagagggtc ttcttcctcc 2636 
ttcccccagc cagccagctg tcctggggcc aggcltlcct gggtggaaag aagtalaccl 2696 
ttccctgggg ccclaggata gcaaagtgag ccalaglggg ccaggctgcc ctccatgctg 2756 
ggecceagcc cagglctgea ctcgeetgga teacettctl tgagecttag ecatetcetg 2816 
tcaggtagga atgaactlgc eagccltcag gctcgtlcag ctatgaccat ctgtgcggtc 2876 
agggtacact cagctctcel ccccaaclcc agcagcctll aagaaglgtc ccttlggcgc 2936 
cccctggagg cagagcactg agctggaccc tgggiagact cccacaggga ggaeggaget 2996 
ggcctcagga gtgggacacc cagacttggc agggcr.ttca agaggcclgt gtgggggccc 3056 
caggaatcct tagctgaagc ggggagaclc actctccatc tcaggaaatt clagcccttg 3116 
ccclcaggga gccacggttg agggtgaggc ccaacacctg ccttagggcc ctgggtgggc 3176 
aagtctgggc cctggggtag ggagggagac tcaggcccac acltgggtal tttctaaltl 3236 
cagacaaaca cacactcagc gcgcactcac tgattcctac acattgecaa gatttcacac 3296 
atgtgaccag gggccaccaa aglccctgig acctttglga ctaggatcct aalltctcta 3356 
Itttclcclg gglgcctggg tctglgtcac ctggggcagt gtggataatg ttlagttclg 3416 
tgacactglt tlttgggggl ggcacctggl Iclccgatgc ctgggctggt gtcaggccca 3476 
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ggactgtagl gclgggagca gtaaagctca gctcigtgta algagtgatg ctatggcltg 3536 

ctcgtglcti atgatccaat cctlttctac atcagccclt gttttgittt atggctaglc 3596 

ttalctggcc tggttatttc cttgcgggga ggagagggtt tgctaatctg ctcccagccc 3656 

aacctattac caccccaccl cgctgggacc taclgctcgg gaggcagcag acagggagcc 3716 

accagcagtg gcttcclggc cctgtgctgg gggtgggggg aagctggggg cacalgiggc 3776 

ccttgcctlc tgagcagctc ccaglgccag ggclttgaga ctttcccaca tgalaaaaga 3836 

aaagggaggt acagaagtlc caattccctt Mtattttgc tggitggtal ctgtaaatgt 3896 

tlaataaata tctgagcatg tatctatcaa cgccaagaal ttcaaagtct ccttcaacaa 3956 

tatgaggctt ttaggatglt latattcctl catccctctt gtttcccagg ttttgcaggg 4016 

aaaaaaaglc tggaaltata gatacagctt at tat t a a a t llgttcttgc ataaaaaaaa 4 07 6 
aaaaaaa 4083 
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